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Abstract 

The Classification Literature Automated Search Service, an annual 
bibliography based on citation of one or more of a set of around 80 book 
or journal publications, ran from 1972 to 2012. We analyze here the years 
1994 to 2011. Service, as it was termed, is produced by the Classifica- 
tion Society. In earlier decades it was distributed with the Journal of 
Classification. 

1 Introduction: 40 Years of Service, the Classi- 
fication Literature Automated Search Service 

The Classification Literature Automated Search Service started with Volume 1 
in 1972 (using the previous calendar year's data). It was published as a printed 
booklet from the start and bore the ISSN 0731-4043. It was necessary in the 
editorial and printing team to plan carefully the total number of printed pages, 
to liaise with Springer's headquarters and distribution center (these were in 
New York City and in Secaucus, New Jersey) and the printing company used 
(Sheridan Press, Pennsylvania) and then to have copies of the bibliography 
shipped with the Journal of Classification to subscribers and also to libraries. 

From June 1984 through May 1993, the Editor of Service, or sometimes 
CSNA Service as it was referred to, was William H.E. Day. During that time 
the Technical Editors were Elaine Boone (1984-1988) and Eva Whitmore (1989- 
1993). Bill Day also had the assistance of Todd Wareham, a computer science 



student then, in the preparation of camera-ready copy for volumes 14-16 (1985- 
1987). Technical Support included use of C programs and Unix scripts for 
reformatting the data. Bill Day was based in the Department of Computer 
Science, Memorial University of Newfoundland, St. John's, Newfoundland. 

The data was obtained from ISI, the Institute for Scientific Information, 
which is now a subsidiary of Thomson-Reuters (and publishes the World of 
Science, the Science Citation Index, the Social Science Citation Index, and 
other products). Processing of the bibliographic data was always required. In 
the early years, a range of nroff and troff text processing utilities were used to 
re-format the data. Due to limitations on the output and distributed format 
(book, later diskette, then CD), various algorithms were applied to restrict the 
quantity of data. This included filtering by listing journal titles, and keywords 
to be excluded in titles of published articles. Thus, in the latter case, medical 
terms, or "galaxy cluster" , betokened non-algorithmic matters and hence were 
to be excluded. 

Before Bill Day, Roger K. Blashfield (University of Florida) was Editor. 
Fionn Murtagh was Editor from 1993 to 2008. Michael Kurtz was Editor there- 
after. Eva Whitmore remained as Technical Editor, having started as noted 
above in 1988. 

In the 1990s the bibliography went to diskette format and there too we 
rapidly went to the storage capacity of the media at that time - 5.25 inch "flop- 
pies" that were to be replaced by 3.5 inch diskettes. It made sense then, as 
announced by us in the production team in October 1999, to transit to CDs, 
which additionally allowed us, due to the storage available, to have previous 
years' bibliographies, and then to have scanned copies of "profile" books avail- 
able on the CD. Below it is explained just how the profile of books and articles 
was used to drive the retrieval process and thereby to define the domain of 
interest. 

In 1994, on-line content search to the bibliographies was supported by the 
WAIS, Wide- Area Information System, distributed search and retrieval stan- 
dard. This was an early forerunner of the search engines to come a few years 
later. 

As either book or as CD, the bibliography, that was termed Service, was dis- 
tributed as a supplement to the Journal of Classification, published by Springer 
on behalf of what is now called the Classification SocietjQ The Classification 
Society was set up in April 1964. In December 1968, European and North 
American branches were set up, and were largely autonomous. These branches 
became the Classification Society of North America (CSNA), and the British 
Classification Society. In 2008, CSNA reverted to its former name of the Clas- 
sification Society. (Various historical documents can be found on the web sites 
of the Classification Society, and of the British Classification Societjj^J) The 
Journal of Classification saw its Volume 1, Number 1, in 1984, and its first 
Editor-in-Chief was Phipps Arabie (born 13 March 1948, died 23 June 2011). 

1 http: / /www. classification-society.org/clsoc 
2 Currcntly http:/ /thames.cs.rhul.ac.uk/bcs 
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Figure 1: A typical cover of the printed volume of Service. 

Phipps Arabie was a very strong supporter always of the Service bibliographies. 
From 2002 to date (2012), Willcm Hciser is Editor-in-Chief of the Journal of 
Classification. 

In 2008, the last CD was produced, due to the plan be be web-based only. 
In 2012, ISI discontinued the provision of data completely. For online access 
now to Service, see https: //www. ci a. harvard, edu/^kurtz 

The following] is from the introduction to Volume 23, 1994. It explains the 
mechanism used to carry out the searches and to assemble the bibliographies. 

"This volume of the Classification Literature Automated Search Ser- 
vice contains a bibliography of 2497 classification-related journal pa- 
pers which appeared in 1994. In order to use the Service knowledge- 
ably for reference, readers should know about the databases from 
which the journal papers were selected, the criteria employed to 
identify classification-related papers, and the mechanisms provided 
to access bibliographic information about classification-related pa- 
pers. 

The first step in constructing the bibliography is to collect data 
about journal papers. The Service obtains these data from Research 
Alert, a bibliographic service of the Institute for Scientific Informa- 
tion (ISI, Philadelphia). Research Alert enables the Service to access 
papers in over 8000 science, technical, and social sciences journals 
including those from the Science Citation Index, Social Sciences Ci- 
tation Index, and Arts Humanities Citation Index databases. To 
use Research Alert, the Service's Editorial Board compiles a list, or 



'Currently available as file service23.profile.txt at address ftp://ftp.pitt.edu/group/csna 
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profile, of books or papers, called profile items, that are indicative 
contributions to the theory and practice of classification. A journal 
paper is considered to be classification-related if it cites one or more 
profile items. Using the profile, Research Alert provides the Service 
with bibliographic information about classification-related papers in 
recent issues of the 8000 journals it processes. 

Research Alert's selection procedure depends completely on the pro- 
file. The Editorial Board reviews the profile regularly in order to 
ensure that the papers selected by Research Alert are relevant to 
classification and related areas of data analysis. The profile for this 
volume of the Service contained 82 items and appears in the file 
'profile.txt'. The Editor welcomes your suggestions for improving 
the composition of future profiles." 

2 The Data: The Bibliographies of Clustering 
from 1994 to 2011 

2.1 The Profile Publications Used to Drive the Search and 
Retrieval 

The "profile" publications used in the past few years are listed in the Appendix. 
Citing any one of these publications was therefore the criterion used for assem- 
bling the annual bibliography. 

The number of citations per year is shown in Table [T] This relates to the 
bibliographies for the years 1994 to 2011. 

It is to be noted how some of the profile publications were introduced in a 
given year. (Consider, for example, Blashfield76, introduced from 2004.) 

In the case of, for example, Bishop95, in pre-1995 years, the search term 
used here ( "BISHOP CM" ) picked him up as an author of another publication 
and not the profile publication, his 1995 book. 

Note too that the (different) works of some authors are combined by us. 
Such is the case for example for two publications by Doug Carroll, published 
in 1970 and in 1980. (J. Douglas Carroll, 1939-2011, worked most recently 
at Rutgers Business School. He was an early developer of, and founder of the 
field of, multidimensional scaling and other methods and their applications in 
psychometrics . ) 

2.2 Changed Data Provision After 2003 

What is particularly noticeable about Table [T] is the increase in citations over- 
time. See Figure [2] While it is the case that (i) there was some net increase 
each year, but (ii) nonetheless the lack of constraint related to distribution 
medium from 2004, (iii) that can be coupled with the massively growing volume 
of research production worldwide, and finally (iv) the high point of 2009, maybe 
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Table 1: Frequencies of occurrence found for the 82 profile publications, over 
18 years. There are 135,088 bibliographic citations in all, i.e. the grand total of 
this table. 
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Based on 82 'profile' publications 
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Figure 2: Increase of citing bibliographic entries over the years. 

given a lag to be expected in publishing following the economic downturn in the 
Western countries that started in 2008. Among other changes from 2004, see 
how Sneath73 is replaced largely (cf. Table [l]) by the earlier jointly authored 
volume, Sokal63. (P.H.A. Sneath, University of Leicester, UK, 17 November 
1923 - 9 September 2011, and R.R. Sokal, State University of New York, 13 
January 1926 - 9 April 2012, were key names in the development of numerical 
taxonomy.) 

Table[2]is an alternative view of the 18 years we are dealing with. This table 
shows a range of discipline names that are picked up in the data by their appear- 
ance in a journal title, or a publication title, or an area title. The terms used 
are: Medicine, Biology, Physics, Chemistry, Astronomy, Mathematics, Statis- 
tics, Engineering, Psychology, Psychiatry, Literature, Humanities, Economics 
and Sociology. 

From the editorial report of Service to the Board of the Classification Society 
of North America in June 2003, there is the following explanation of the sea- 
change in the bibliographic source data from 2004 onwards. 

"The CD containing Service data, cumulative over a number of 
years, with a Java graphical user interface, and copies of Hartigan's 
(scanned) and van Rijsbergen's classical books, was distributed as 
usual with issue 1 of the Journal of Classification. Number of copies 
produced 525. ... 

Up to now, Eva got the data quarterly and initially processed it at 
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2 


1 





15 


12 


6 


12 


10 


13 


10 


9 


Lit 


2 


1 


1 


1 


3 


1 


3 


2 


1 


5 


9 


9 


8 


4 


3 


7 


9 


10 


Hum 


9 


19 


15 


15 


18 


14 


23 


17 


19 


36 


1 





1 


1 


4 


4 


7 


4 


Eco 














2 








2 


1 


1 


26 


29 


33 


30 


40 


53 


45 


42 


Soc 


21 


24 


27 


16 


25 


17 


17 


24 


24 


20 


7 


2 


5 


16 


2 


13 


11 


12 



Table 2: Frequencies of occurrence found for the discipline terms - used in 
journal or article titles or otherwise. The columns are the years 1994 to 2011. 
See text for the spelling out of the disciple labels. The number of occurrences 
here is, in all, 23,997. 



Memorial University. (A long time ago Bill Day there was the link 
with Memorial). Probably the scripts in use there are 15 years old, 
or more. Then I did some processing, with a number of Unix scripts. 
For the CD, a Java application based search GUI was written 2 or 
3 years ago, and of course assumed the particular format discussed 
above. Now ISI, from whom we purchase the data (about USD 70 
per profile item) arc changing the dissemination mechanism and the 
format. ... 

... our new format for receiving data from ISI. [ISI] emailed me about 
a week ago and informed me that "Research Alert" data will no 
longer be available - they are switching totally to "Personal Alert" , 
as below. We get the same data, in a weekly email, but as you can 
see, the format is different. 

[...] 

I notice this data has keywords associated." 
Tables [3] and 0] are indicative of these formats. 
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T Learning to Set-Up Numerical Optimizations of 

T Engineering Designs 

A SCHWABAC . M 

A ELLMAN T 

A HIRSH H 

K MATHEMATICAL SCIENCES - Computer Science 

U A I EDAM 12(2): 173-192, APR 1998 

W M Schwabacher, Natl Inst Stand & 

W Technol, Gaithersburg, MD 20899 

W. BREIMAN L 84 



Table 3: "Research Alert" format for the bibliography data used up to 2003. 
First entry of Volume 27, 1998. AI EDAM is the Cambridge University Press 
Artificial Intelligence for Engineering Design, Analysis and Manufacturing. The 
"profile" entry is Breiman, Friedman, Olshen and Stone, Classification and Re- 
gression Trees, Wadsworth, 1984. 



TITLE: 



AUTHOR: 



SOURCE: 



Multiscale spatial variation of the bark beetle Ips 

sexdentatus damage in a pine plantation forest (Landes de Gascogne, 

Southwestern France) (Article, English) 

Rossi, JP; Samalens, JC; Guyon, D; van Haider, I; 

Jactel, H; Menassieu, P; Piou, D 

FOREST ECOLOGY AND MANAGEMENT 257 (7). MAR 22 2009. 
p. 1551-1557 ELSEVIER SCIENCE BV, AMSTERDAM 



SEARCH TERM(S) : RIPLEY BD rauth; DENSITY ESTIM* rwork; MULT I* rwork 



KEYWORDS: Bark beetle; Ips sexdentatus; Pinus pinaster; Spatial 

statistics; Ripley's statistic; Aggregation; Landscape; 
Plantation forest 

KEYW0RDS+: POINT PATTERN-ANALYSIS; TYPOGRAPHUS L. ; FELLED TREES; 

SPRUCE; SC0LYTIDAE; C0LE0PTERA; MANAGEMENT; WINDTHR0W; 
RISK; COLONIZATION 

AUTHOR ADDRESS: JP Rossi, INRA, UMR BI0GEC0, Domaine Hermitage 69 Route 
Arcachon, F-33612 Cestas, France 



Table 4: "Personal Alert" format for the bibliography data used from 2004. First 
entry of Volume 38, 2009. The "profile" item is B.D. Ripley, Spatial Statistics, 
Wiley, 1981. 
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3 Sematic Analysis of Profile Publications and 
of Disciplines, over 18 Years 

3.1 Semantic Analysis 

Take the observables, e.g. profile publications, or disciplines, as indexed by i. 
Take the attributes, e.g. the years, as indexed by j. Call the mass of observable 
i to be fi, and analogously the mass of attribute j, fj. These masses are 
components of marginal distributions. Alternatively expressed, the fi and fj 
terms, for all i and j, are respectively the empirical probability distribution 
defined on the set of all observables, i, and on the set of all attributes, j. The 
domains of the function / are thus, respectively, the observables set and the 
attributes set. 

The frequency of occurrence data used for observable i and attribute j is 
fij . Correspondence Analysis is firstly and foremostly the study of discrepancy 
of fij from a sort of null hypothesis expressed by fifj. 

A successively best fit Euclidean representation is found, to embed the ob- 
servable set, and the attribute set. Let the observable i have embedding, firstly, 
and then, secondly, projection relative to factor ip, and similarly for attribute 
j relative to factor cf>. The associated eigenvalue of the pair of factors ipi and </> 
is A. 

We require the semantic relationship tying together observables and at- 
tributes vis-a-vis each successive factor: 

VXipi — fy&j an d in the dual space VX(pj — fy-ipi 

3 h i fj 

These are termed transition formulas. 

Supplementary elements, rows or columns, are when we use fij values that 
are, through these relationships, projected post hoc into the analysis. 

The semantic analysis framework is now used to provide (1) visualization, 
seeking particular salient interrelationships in the data, and (2) summarization 
of the data through clusters, where we use years, disciplines and publications 
to achieve a good understanding of the data. Here (1) is a planar, and hence 
low-dimensional, expression of the data, whereas in (2) the clustering is carried 
out in the data's full dimensionality. 

3.2 Major Change: Pre-2004 and From 2004 Onwards 

The profile publications, as seen in Table [l] contain inclusions and withdrawals, 
and also data source issues over which there was no control. So a more suitable 
analysis, because it was based on free text search and no more than that, is 
based on the discipline labels. Therefore our first analysis is of the frequency 
of occurrence data for 16 disciplines crossed by 18 years. Figure [3] shows the 
principal factor plane. Humanities (denoted Hum), is off to the left (on the 
positive side of the ordinate). 
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The major issue of note in Figure [3] is how one-dimensional the data is. 
In information content expressed by percentage of inertia explained by these 
principal axes, the first axis dominates. 

The (red in the original) lines connect the successive years, that are projected 
onto this principal factor plane. The clump of years around Math (mathematics) 
and Psych (psychology) are all the years 1994 to 2000; and then just north east 
of them is 2001, followed by 2002, and north west of it, 2003. They are all on 
the negative side of the first factor, in terms of projections on that factor. Over 
to the positive side of the first factor are the years 2004 through to 2011. The 
years that lie out a little to the north of the clump are 2011 and (the further 
away year) 2008. 

Can we say that Math and Psych, and Soc (sociology) are more typical of 
the earlier years in regard to cluster analysis research; and that if anything Mgt 
(management) is most typical of the later years here, in regard to computer 
science research? In order to address this, we ought to look at the full dimen- 
sionality of the data rather than just a 2-dimensional projection. This will be 
done below. 

Profile publications are shown as dots in Figure [3] (so as not to crowd the 
figure). Some are projected well off this figure. Because these are cited publi- 
cations rather than coming from a given discipline, let us look at them through 
the cluster analyses to follow now. 



3.3 Semantics Analyzed through Clustering of Years, Dis- 
ciplines and Publications 

In the following we use the Euclidean space, with equiweighted points, as pro- 
vided by the Correspondence Analysis. The points in this space are equi- 
weighted. Furthermore we use the full dimensionality Euclidean space. Since 
the active analysis was on the 16 disciplines crossed by 18 years data, the full 
dimensionality of the Euclidean factor space is min (16 — 1, 18 — 1), i.e. 15. In this 
15-dimensional space (illustrated by the planar projection in Figure [3] we thus 
have disciplines and years projected, and then as passive (or post hoc) elements 
we have publications projected. Because the projection takes full account of 



interrelationships as discussed in subsection 3.1 we have that years, disciplines 
and publications are all projected into the same space. 

Ward's minimum variance hierarchical clustering using Euclidean distances 
is an appropriate method to use. It is appropriate in the sense that it uses 
aggregation based on inertia (masses all identical) which dovetails with the 
inertia-based decomposition of the Correspondence Analysis. (This hierarchical 
clustering criterion was initially described by Joe H. Ward Jr., who died on 23 
June 2011, aged 84.) 

Figure [4] relates to disciplines and years. The very clear year-based divi- 
sion of the data is displayed by the two big branches in the dendrogram. We 
also have further support of the quite key role of Psych (psychology) and Math 
(mathematics), and others, in the early years; and the key role of Mgt (man- 
agement), Stat (statistics) less pronounced but present, and others, in the later 
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16 disciplines; 18 years (connected); 82 publications (dots) 




Figure 3: Principal factor plane of a Correspondence Analysis of (primary 
analysis) 16 disciplines crossed by 18 years; and projected into this (supple- 
mentary elements) 82 "profile" publications. (Medicine is very slightly reposi- 
tioned to avoid overlap. Humanities is off the figure to the left, at coordinates 
-1.168,0.251.) 
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Hierarchy (Ward criterion) of 18 years and 16 disciplines 



o 



ID - 




Years 1994 to 2011; 16 disciplines 



Figure 4: Hierarchical clustering of the years and the disciplines. (Relative 
horizontal positioning of terminals is for display purposes only.) 

years. 

In Figure [5j the 82 documents are also included. For discussion of clusters, 
we will use the labels shown in Figure [6j This allows us more easily to discuss 
the publications, and their associations with years and disciplines, in order to 
home in on major trends and patterns in this data. 

Cluster 1 (cf. Figure [6]): 

Disciplines: Med Phys Astr Eng Ecol Psy Lit Econ Mgt 

Years: 04 05 06 07 08 09 10 11 

Publications: Bezdek81 Blashfield76 Duda73 Everitt79,80 Fisher36 Fukunaga72 
Gordon81 Hand81 Hubert7685 Jain88 Kohonen95 Mayr69 McLach- 
lan88,92,97 Murtagh83 Pavlidis77 Rand71 Ripley81 Sokal63 

Cluster 2: 

Disciplines: Bio Chem Stat 



x 
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Hierarchy (Ward) of years, disciplines, publications 



CD 



18 years, 16 disciplines, 82 publications 



Figure 5: Hierarchical clustering of the years, the disciplines, and the profile 
documents. 
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Hierarchy (Ward} of years, disciplines, publications 



D 

in 



<3> 

1 8 











s 







Partition with 5 clusters with cluster labels shown 



18 years, 16 disciplines, 82 publications 



Figure 6: As Figure [5j where clusters of the 5-class partition are labelled. 
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Publications: Breiman84 Cormack71 Devijver82 Diggle83 Efron83 Eldredge80 Fried- 
man77 Fu74,82 Gower66 Hartigan75 Hennig66 Jardine71 Johnson67 
Kluge69 Kruskal64,78 Lance67 Legendre83 Lorr83 Mantel67 Milli- 
gan80,81,85 Nei72 Nelson81 Orloci78 Punj83 Reyment84 Sankoff83 
Silverman86 Spaeth80 Tversky77 VanLaarhoven87 Ward63 Wiley81 
Wolfe70 Zahn71 

Cluster 3: 

Disciplines: Math Psych Soc 

Years: 94 95 96 97 98 99 00 01 02 03 

Publications: Adams72 Anderberg73 Avise74 Benzecri73 Farris72 Felsenstein82 Fitch67 
Greenacre84 Guttman68 Hill74 Michalski83 Nosofsky84 Rohlf82 Sat- 
tath77 Schiffman81 Sneath73 Spitzer74 Swofford81 Wishart87 

Cluster 4: 

Discipline: Hum 

Publications: Arabie87 Carroll70,80 Cover67 Gauch82 Gnanadesikan77 Huber85 
Maddison.84 Rammal86 Sammon69 

Cluster 5: 

Publications: Bishop95 VanRijsbergen79 

From these clusters it can be seen how the "classical" period characterized 
by cluster 3 is counterposed to the "modern" period of cluster 1. 

The dominant disciplines of the "classical" period were Math, Psych and 
Soc (mathematics, psychology and sociology). Certainly some of the profile 
publications cited in the "classical" period come from ecology, phylogeny and 
even machine learning, but this is not a matter of their disciplines but rather 
cross-discipline influence. 

For the "modern" period, cluster 1, it is seen in the planar projection of Fig- 
ure[3]how Mgt, management, is very central. Other disciplines that characterize 
especially this cluster are noted above. The more influential profile publications 
can be read off too. 

Clusters 4 and 5 are broadly associated with the "classical" period. The 
pattern recognition and information retrieval profile publications of cluster 5 
are in tune with this (given the major ongoing role certainly from the 1960s of 
these sub-disciplines). 

Cluster 2, closest to the "modern" period, is characterized most of all by the 
disciplines of Bio, Chem, Stat, viz. biology, chemistry and statistics. See how 
in Figure [3j we would not have found that outcome from the planar projection 
alone. 
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4 Conclusions 



The 135,088 citations to one or more of the 82 profile publications have led 
us to find a major thematic shift in clustering research over the 18 years con- 
sidered here. At its most basic, this thematic shift is from the central role of 
mathematical psychology in the years 1994 to 2003, and then the central role 
subsequently of management. A trend of massive proportions has also been the 
annual increase in Service contents. Other less pronounced trends can be noted 
also. 

Cluster analysis has shown, and continues to show, great vitality in terms 
of responding to the challenges raised in many different disciplines. Vitality is 
both methodological and practical. 
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Appendix: The Profile Publications Used 



AUTHOR 


JOURNAL/BOOK TITLE 


VOL 


P. YR. 


ADAMS EN 


SYST ZOOL 


21 


390 


72 


ANDERBERG MR 


CLUSTER ANAL APPLICA 






73 


ARAB IE P 


3 WAY SCALING CLUSTE 






87 


AVISE JC 


SYST ZOOL 


23 


465 


74 


BENZECRI JP 


ANAL DONNEES 






73 


BEZDEK JC 


PATTERN RECOGNITION 






81 


BISHOP CM 


NEURAL NETWORKS PATT 






95 


BLASHFIELD RK 


PSYCHOL B 


83 


377 


76 


BREIMAN L 


CLASSIFICATION REGRE 






84 


CARROLL JD 


ANN R PSYCH 


31 


607 


80 


CARROLL JD 


PSYCHOMETRI 


35 


283 


70 


CORMACK RM 


J ROYAL STA A 


134 


321 


71 


COVER TM 


IEEE INFO T 


13 


21 


67 


DEVIJVER PA 


PATTERN RECOGNITION 






82 


DIGGLE PJ 


STATISTICAL ANAL SPA 






83 


DUDA RO 


PATTERN CLASSIFICATI 






73 


EFRON B 


J AM STAT A 


78 


316 


83 


ELDREDGE N 


PHYLOGENETIC PATTERN 






80 


EVERITT BS 


BIOMETRICS 


35 


169 


79 


EVERITT BS 


CLUSTER ANAL 






80 


F ARRIS JS 


AM NATURAL 


106 


646 


72 


FELSENSTEIN J 


Q REV BIOL 


57 


379 


82 


FISHER RA 


ANN EUGENICS 2 


7 


179 


36 



16 



FITCH WM 


SCIENCE 


155 


279 


67 


FRIEDMAN JH 


ACM T MATH 


3 


209 


77 


FU KS 


f-l -r fUTrp A /"IT" T /~~\ TI JTT — ' rpT T (~\ T\ T\ A 

SYNTACTIC METHODS PA 






74 


FU KS 


mruTT ft /it t /"i n A rpirpiT-ipi t.t np 

SYNTACTIC PATTERN RE 






82 


FUKUNAGA K 


T T.Trp<P\ r\ rp A rn T rp T /~1 A T T\ A 

INTRO STATISTICAL PA 






72 


GAUCH HG 


Tl AT TT rp T T T A PS T A TP A TIT A T /"I r~"l 

MULTIVARIATE ANAL CO 






82 


/I T A TlT A PVP" ■ fi TT7 A T.T 

GNANADESIKAN 


Ti IprPTTHRP) f~1 rpi A rp T r~1 rpi -p /I * T 

METHODS STATISTICAL 






77 


GORDON AD 


rtT A frrlT T — > t A rpi TOUT 

CLASSIFICATION 






81 


GOWER JC 


BIOMETRIKA 


53 


325 


66 


/"I P» T — 1 T — 1 Ti T A /"^P* T — I TIT T 

GREENACRE MJ 


rpiT TP" nmj A P"\P"\T ~T /"I A rpi THP /"I 

THEORY APPLICATION C 






84 


GUTTMAN L 


PSYCHOMETRI 


33 


469 


68 


HAND DJ 


PS ~T r~t /ITS TAffTUT A rpi "p r™l Ti T **(T A H 

DISCRIMINATION CLASS 






81 


T T A P\ rpi T/1 A TkT T A 

HARTIGAN J A 


/"■IT T Tr"* rppip TIT/1 AT /inT> T rpiTTT.iT 

CLUSTERING ALGORITHM 






75 


HENNIG W 


PHYLOGENETIC SYSTEMA 






66 


HILL MO 


A P1T\T f^rpi A rpi 

APPL STAT 


23 


340 


74 


TTTTT-lTir, T\ T 

HUBER PJ 


A T.THT r"1 rn A rpi t r~1 rpi 

ANN STATIST 


13 


435 


85 


HUBERT L 


PSP1 T Tlrt" A rpiTT 

BR J MATH S 


29 


190 


76 


HUBERT LJ 


T /"I T A f"*1 /"*( T T~l 

J CLASSIF 


2 


193 


85 


TAT TIT A 

JAIN AK 


AT (*inn T rpiT TTl iTrl /"IT TTH Til — ipt "p n T 

ALGORITHMS CLUSTERIN 






88 


t a nr\TMri tit 

JARDINE N 


MATH TAXONOMY 






71 


JOHNSON SC 


PSYCHOMETRI 


32 


241 


67 


T7"T TT/1 T — ' A /I 

KLUGE AG 


SYST ZOOL 


18 


1 


69 


KOHONEN T 


SELF ORG MAPS 






95 


T7TS T Tf~1 IT A T TT1 

KRUSKAL JB 


T. JTT TT rpi T P\ ~T H ffPlTkTTI TriHT A T fl /"I A 

MULTIDIMENSIONAL SCA 






78 


T7r\TTHT7 A T TT1 

KRUSKAL JB 


PSYCHOMETRI 


29 


1 


64 


T A TiT l~%T~* ft TiT 

LANCE GN 


COMPUTER J 


9 


373 


67 


T T~l /"I T — 1 1TTV T^» T~» T 

LEGENDRE L 


TlTT TT. T — ipi T /I A P pi /I f~\ T n/TT7 

NUMERICAL ECOLOGY 






83 


t nnn ti t 

LORR M 


rtT TTfl rpip- 1 PI 1 IT 1 T r( n /"I TAT 

CLUSTER ANAL SOCIAL 






83 


MADDISON WP 


SYST ZOOL 


33 


83 


84 


T, JT A Ti T rpi T — 1 T TIT 

MANTEL N 


i~% A HT/IPiTS r\Pin 

CANCER RES 


27 


209 


67 


Tiff ft T7T1 T~" 

MAYR E 


PRINCIPLES SYSTEMATI 






69 


T-VT/^T A /1TTT A TIT /I T 

MCLACHLAN GJ 


DISCRIMINANT ANAL ST 






92 


T.VT/'iT A /1TTT A TIT /*% T 

MCLACHLAN GJ 


T — I n jT A P TiflTST rpTTA JT PITT rpipTBT f~1 T 

EM ALGORITHM EXTENSI 






97 


T-VT/^T A /1TTT A TIT /*% T 

MCLACHLAN GJ 


TliT T ~\T rpiT TP T~l T, JT 7"^ PS P~l T r~1 T ATP>Pin 

MIXTURE MODELS INFER 






88 


MICHALSKI RS 


MACHINE LEARNING 






83 


MILLIGAN GW 


MULTIV B R 


16 


379 


81 


MILLIGAN GW 


PSYCHOMETRI 


45 


325 


80 


MILLIGAN GW 


PSYCHOMETRI 


50 


159 


85 


MURTAGH F 


COMPUT J 


26 


354 


83 


NEI M 


AM NATURAL 


106 


283 


72 


NELSON G 


SYSTEMATICS BIOGEOGR 






81 


NOSOFSKY RM 


J EXP PSY L 


10 


104 


84 


ORLOCI L 


MULTIVARIATE ANAL VE 






78 


PAVLIDIS T 


STRUCTURAL PATTERN R 






77 


PUN J G 


J MARKET RES 


20 


134 


83 


RAMMAL R 


REV M PHYS 


58 


765 


86 



17 



Tl ft HTT\ T TH JT 

RAND WM 


t ft ji/r nrn n m a 

J AM STAT A 


66 


846 


71 


REYMENT RA 


Tl ITT TT rp T T T ft T\ T A rriT — I TV If T*\ nTTnilT 

MULTIVARIATE MORPHOM 






84 


RIPLEY BD 


f-H T-\ ft m TAT fi rp A rp t fl rri T /~1 fi 

SPATIAL STATISTICS 






81 


ROHLF FJ 


MATH BIQSCI 


59 


131 


82 


SAMMON JW 


IEEE CDMPUT 


18 


401 


69 


/-*■ ft TlTT/ f"^ ~| — IT — 1 TV 

SANKOFF D 


rri -r i\ ill — I t t ft p\ T"\ f~1 f% rw-iT\ "T" TiTrt 1 I 1 \ 

TIME WARPS STRING ED 






83 


f~H ft mrpi A rpTT 

SATTATH S 


PSYCHOMETRI 


42 


319 


77 


SCHIFFMAN SS 


INTRO MULTIDIMENSION 






81 


SILVERMAN BW 


DENSITY ESTIMATION S 






86 


/-+ -r, T -I— I A rpTT T>T7 A 

SNEATH PHA 


NUMERICAL TAXONOMY P 






73 


SOKAL RR 


PRINCIPLES NUMERICAL 






63 


SPATH H 


CLUSTER ANAL ALGORIT 






80 


SPITZER RL 


BRIT J PSYCHI 


125 


341 


74 


SWOFFDRD DL 


J HEREDITY 


72 


281 


81 


TVERSKY A 


PSYCHOL REV 


84 


327 


77 


VANLAARHOVEN 


SIMULATED ANNEALING 






87 


VANRI JSBERGEN 


INFORMATION RETRIEVA 






79 


WARD JH 


J AM STAT A 


58 


236 


63 


WILEY EO 


PHYLOGENETICS 






81 


WISHART D 


CLUSTAN USER MANUAL 






87 


WOLFE JH 


MULTIV B R 


5 


329 


70 


ZAHN CT 


IEEE COMPUT 


20 


68 


71 



18 



